Maximum Separation Partial Least Squares (mspls): a New Method for Classification in Microarray Experiment
نویسندگان
چکیده
The purpose of the paper is to propose a new method for classification. Our MSPLS method was deduced from the classic Partial Least Squares (PLS) algorithm. In this method we applied the Maximum Separation Criterion. On the basis of the approach we are able to find such weight vectors that the dispersion between the classes is maximal and the dispersion within the classes is minimal. In order to compare the performance of classifier we used the following types of dataset – biological and simulated. Error rates and confidence intervals were estimated by the jackknife method.
منابع مشابه
Feature Selection in Tumor Classification Using Microarray Gene Expression Data
Feature selection is the process of choosing a subset of the original predictive variables through the elimination of redundant and uninformative representatives. An example of importance is the analysis of gene expression data from DNA microarray hybridization experiments. The data obtained from the experiments usually contain a few samples each with expression levels of a large number of gene...
متن کاملBuilding Classification Models from Microarray Data with Tree-Based Classification Algorithms
Building classification models plays an important role in DNA mircroarray data analyses. An essential feature of DNA microarray data sets is that the number of input variables (genes) is far greater than the number of samples. As such, most classification schemes employ variable selection or feature selection methods to pre-process DNA microarray data. This paper investigates various aspects of...
متن کاملClassification using partial least squares with penalized logistic regression
MOTIVATION One important aspect of data-mining of microarray data is to discover the molecular variation among cancers. In microarray studies, the number n of samples is relatively small compared to the number p of genes per sample (usually in thousands). It is known that standard statistical methods in classification are efficient (i.e. in the present case, yield successful classifiers) partic...
متن کاملFeature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملStability of Gene Selection Methods for Multiclass Clssification
A big problem in applying DNA microarrays for classification is dimension of the dataset. Recently we proposed a gene selection method based on Partial Least Squares (PLS) for searching best genes for classification. The new idea is to use PLS not only as multiclass approach, but to construct more binary selections that use one versus rest and one versus one approaches. Ranked gene lists are hi...
متن کامل